Model Selection

Dense prediction

# Dense prediction

Dpt Swinv2 Base 384

The DPT (Dense Prediction Transformer) model is trained on 1.4 million images for monocular depth estimation. This model uses Swinv2 as the backbone network and is suitable for high-precision depth prediction tasks.

Dpt Dinov2 Small Kitti

DPT model using DINOv2 as backbone for depth estimation tasks.

Dpt Hybrid Midas

A monocular depth estimation model based on Vision Transformer (ViT), trained on 1.4 million images

This is a Dense Prediction Transformer (DPT) model fine-tuned on the ADE20k dataset for semantic segmentation tasks.

Image Segmentation

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase